D.20 STATISTICA

Approximate Cost: $1,195

Source:  StatSoft (www.statsoft.com)

Current Version: STATISTICA 12

Operating System Needs: Windows 7 (recommended), Windows Vista, Windows XP

Input Structure: Can directly open spreadsheet, text, and database files

Overview

STATISTICA is user friendly, while still allowing for significant customization and functionality. The base package computes practically all common descriptive statistics and can produce a wide variety of customizable graphics. The base software includes graphics tools along with the following modules:

Add-Ins Available

STATISTICA also has a number of add-in packages and modules that can enhance the functionality of the base software package.

Ease of Use and Data Import

STATISTICA is designed as a user-friendly software package. For additional help with the program, you can watch a wide variety of training videos on the website or take a training seminar for STATISTICA basics or advanced topics. 

STATISTICA can directly open many data types including databases, spreadsheets, and text files. Using STATISTICA Query and Visual Basic, you can easily query and import or export data from databases for statistical analysis. Output data sheets and plots can be sent to workbooks, STATISTICA reports, or Microsoft Word. STATISTICA can also coordinate with the free software package R, allowing you to run R scripts and algorithms from STATISTICA and customize outputs and graphics directly in STATISTICA. You can also record a macro of process steps, allowing for easy reproduction and duplication of analysis.

Types of Distributions

STATISTICA contains a distribution fitting option which directly compares the distribution of data to a wide variety of distributions. Available distributions for fitting include normal, rectangular, exponential, gammaA gamma distribution or data set. A parametric unimodal distribution model commonly applied to groundwater data where the data set is left skewed and tied to zero. Very similar to Weibull and lognormal distributions; differences are in their tail behavior, and the gamma density has the second longest tail where its coefficient of variation is less than 1 (Unified Guidance; Gilbert 1987; Silva and Lisboa 2007)., lognormalA dataset that is not normally distributed (symmetric bell-shaped curve) but that can be transformed using a natural logarithm so that the data set can be evaluated using a normal-theory test (Unified Guidance)., chi-squared, Weibull, Compertz, Binomial, Poisson, geometric, or Bernoulli distributions. Once a distribution has been fit, you can evaluate the fit using a variety of tests and plots.

Additional distribution fitting options are available in the STATISTICA Process Analysis add-in, including the option to calculate the maximum-likelihood parameter. The STATISTICA Advanced Linear/Non-Linear Model add-in allows you to fit data to complex, custom-defined functions.

Visualization

STATISTICA has a wide variety of plotting capabilities in both 2-D and 3-D. Plotting options in the base package include box plots, 2-D and 3-D histograms, bivariate distributions, 2-D and 3-D scatter plots, normal, half-normal, and detrended probability plots, quartile-quartile plots, probability-probability plotsGraphical presentation of quantiles or z-scores plotted on the y-axis and, for example, concentration measurement in increasing magnitude plotted on the x-axis. A typical exploratory data analysis tool to identify departures from normality, outliers and skewness (Unified Guidance)., contour plots, nonsmoothed surfaces, and icons. You can zoom in on portions of the graphs, which can be useful when visualizing larger data sets and when producing cross-section slices from 3-D graphics. STATISTICA also has the option of plotting multiple-subset scatter plotsGraphical representation of multiple observations from a single point used to illustrate the relationship between two or more variables. An example would be concentrations of one chemical on the x-axis and a second chemical on the y-axis. They are a typical exploratory data analysis tool to identify linear versus nonlinear relationships between variables (Unified Guidance). and categorized scatter plots. The program provides many options to customize and format figures and tables for reports and presentations.

Primary Uses for Groundwater Data Analysis

The STATISTICA base package and add-ins include a wide variety of customizable graphics, which are ideal for use in groundwater data analysis. These plots can be used to analyze distribution, illustrate general trends, and support conclusions derived from hypothesis testing and descriptive statistics. STATISTICA is also well known for its strong data mining add-in, which allows you to rapidly analyze large data files and data sets.

Benefits

Limitations and Data Requirements

 

Publication Date: December 2013

Permission is granted to refer to or quote from this publication with the customary acknowledgment of the source (see suggested citation and disclaimer).

 

This web site is owned by ITRC.

1250 H Street, NW • Suite 850 • Washington, DC 20005

(202) 266-4933 • Email: [email protected]

Terms of Service, Privacy Policy, and Usage Policy

 

ITRC is sponsored by the Environmental Council of the States.